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Monoclonal antibodies (mAbs) are highly complex proteins that display a wide range of microheterogeneity that requires 
multiple analytical methods for full structure assessment and quality control. As a consequence, the characterization 
of mAbs on different levels is particularly product - and time - consuming. This work presents the characterization of 
trastuzumab sequence using sheathless capillary electrophoresis (referred as CESI) - tandem mass spectrometry (CESI- 
MS/MS). Using this bottom-up proteomic-like approach, CESI-MS/MS provided 100% sequence coverage for both heavy 
and light chain via peptide fragment fingerprinting (PFF) identification. The result was accomplished in a single shot, 
corresponding to the analysis of 100 fmoles of digest. The same analysis also enabled precise characterization of the 
post-translational hot spots of trastuzumab, used as a representative widely marketed therapeutic mAb, including the 
structural confirmation of the five major N-glycoforms. 



Introduction 

Since 1986 and the approbation of muromonab-CD3 by the US 
Food and Drug Administration (FDA), monoclonal antibodies 
(mAbs) have taken a major market share in the pharmaceutical 
industry and their development is constantly increasing.''' MAbs 
are highly complex glycoproteins potentially displaying many 
naturally-occurring molecular micro-heterogeneities and those 
that are introduced through imperfect processing, physico-chem- 
ical and enzymatic changes that occur during their production 
and long-term conservation. As a consequence, there is a con- 
tinuous need for improvement of analytical methods to enable 
fast and accurate characterization. Mass spectrometry (MS) 
plays a major role in the characterization of therapeutic mAbs 
because it provides specificity, sensitivity and may also provide 
structural information. Characterization using MS has helped 
product and process development^ and batch consistency assess- 
ment.'*'^ Furthermore, recent raising importance of biosimilar 
and biobetter mAbs represents a major stake for characterization 



methodologies. Biosimilar mAbs are produced from different 
clones and processes.^ The European Medicines Agency (EMA) 
and the FDA have published draft guidelines concerning the 
approval path-way of biosimilar mAbs. These documents discuss 
the physicochemical characterization steps required to assess bio- 
similarity. For example, EMA guidelines suggest that the bio- 
similar candidate should have the same amino acid sequence as 
the reference mAb and a comparable glycosylation profile.^ The 
selection of a highly similar candidate to the innovator product 
is mandatory before performing pre-clinical and clinical biosimi- 
larity studies. Obtaining suitable structural information rapidly, 
■which allo^ws quick elimination of candidates dissimilar to the 
reference protein, is therefore crucial. The methods used should 
cover a -wide range of structural information, e.g., amino acid 
sequence, disulfide bonds, glycosylation, posttranslational modi- 
fications (PTM).** Additional to MS, mAbs characterization 
requires a panel of orthogonal methods to cover a full range of 
structural information,'' including reverse phase liquid chroma- 
tography (RP-LC),'°''' ion exchange chromatography (lEC), size 
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exclusion chromatography (SEC),'' capillary isoelectric focusing 
(cIEF)''' or micellar electrokinetic chromatography (MEKC)." '^ 

Peptide mapping of mAbs by MS is commonly performed 
using a bottom-up proteomic approach because it allows acqui- 
sition of information on the amino acid sequence and PTMs. 
In this approach, the protein undergoes enzymatic digestion, 
resulting in a peptide digest most often separated by liquid chro- 
matography and detected by UV spectrophotometry and MS. 

A novel sheathless CE-ESI-MS platform, referred as CESI, 
allowing hyphenation of capillary electrophoresis (CE) to elec- 
trospray ionization mass spectrometry (ESI-MS) has been devel- 
oped by Beckman Coulter based on a design previously described 
by Moini.'^ This CESI prototype (also known as sheathless cap- 
illary electrophoresis) uses a bare fused silica capillary whose 
outlet has been etched using hydrofluoric acid, which makes it 
porous to the electrical transport of small ions without permit- 
ting significant matter transfer through the pores. The porous 
tip provides electrical contact via a second capillary. Detailed 
description of this interface has been given by Haselberg et al.'* 
The sheathless interface has already been used to perform suc- 
cessful CE-ESI-MS experiments in proteomics,"'^" metabolo- 
mics,"' intact proteins'^'^' demonstrating drastically increased 
sensitivity compared with sheath-liquid CE-MS due to operating 
flow rates of well below 100 nL/min. Whitmore and Gennaro 
recently used the CESI prototype to performed the peptide mass 
fingerprinting (PMF) of a mAb. They demonstrated the success- 
ful use of PMF to cover the amino sequence of a mAb, including 
the hydrophilic peptides that are eluted in the void volume in 
RP-LC-MS." 

In this work, we used the CESI-MS system to develop a 
CE-MS/MS method that allowed the fast and precise charac- 
terization of a mAb digest. The mAb selected was trastuzumab 
which was approved by FDA and EMA, in 1998 and 2000 
respectively. Trastuzumab has been intensively studied and can 
be considered as a highly representative therapeutic mAb.^'' It 
is a humanized immunoglobulin gamma 1 (HzIgG-1) directed 
against the HER2/neu receptor which is overexpressed in about 
20-30% of invasive breast cancer patients. Trastuzumab is 
composed of two heavy chains (HC) having 449 amino acids 
and two light chains (LC) having 214 amino acids. The HC 
has an N-glycosylation consensus site on its asparagine (Asn) in 
position 300, and the glycoprotein has a total molecular mass 
of about 148 kDa.'' A tryptic digest of trastuzumab was pre- 
pared using a conventional in-solution digestion protocol."** The 
digest was then studied at different levels. First a peptide map- 
ping of the mAb was performed using a CESI-MS/MS analysis. 
The MS/MS spectra were automatically analyzed to identify the 
peptides using a search engine widely used in proteomic stud- 
ies in order to hasten peptide identifications. PTMs were also 
characterized using CESI-MS/MS, particularly those known as 
degradation hot spots. These PTMs were identified by Mascot 
search algorithm and manually confirmed. At a final level, gly- 
cosylation were also investigated using CESI-MS/MS. Although 
glycosylation is a type of PTM, they are treated in this article 
separately from amino acid PTMs because they are not degrada- 
tion hot spots. The glycans could be located on the amino acid 



sequence and the structure of the major glycoforms could be 
deduced from MS/MS data. Remarkably, the characterization of 
the mAb over these different described levels was realized using 
only one CESI-MS/MS analysis of the digest. 

Results 

Peptide mapping of trastuzumab using CESI-MS/MS. Peptide 

mapping is commonly used to determine protein amino acid 
sequence and to locate or quantify PTMs. This methodology is 
quite important in early development of therapeutic antibodies 
as well as during long-term life cycle management of the bio- 
pharmaceutical products. As part of this work, one of the objec- 
tive was to investigate the peptide mapping of trastuzumab by 
CESI-MS/MS. Tandem MS data interpretation and peptide 
identification was done automatically using Mascot to obtain fast 
and accurate data treatment. 

A sample of trastuzumab was digested with trypsin enzyme. 
Tryptic digestion was performed using an in-solution digestion 
protocol applied on a routine basis in the laboratory. It includes 
reduction and alkylation of disulfide bonds. Prior to digestion, 
an unfolding step was performed using guanidine as a chaotropic 
agent. Prior to MS analysis, peptides were separated using the 
CESI prototype which allows direct coupling of CE with the 
nanospray source of the MS without using an additional liquid 
like in sheath-liquid CE-MS. ^' 

Figure 1 illustrates an example of the separation obtained for 
the tryptic digest of trastuzumab for 100 fmol of injected pro- 
tein. The total duration of the separation was less than 50 min 
and the resulting electropherogram showed that all peptides were 
detected between 19 and 47 min. Separation was performed at a 
voltage of +20 kV to maintain a balance between peptide migra- 
tion speed and separation efficiency. 

Digested peptides were identified from MS/MS data acquired 
using a Mascot search algorithm, although 100% sequence cov- 
erage of a mAb has already been reported using the prototype 
CESI-MS platform.^' In this work, peptide identification was 
automated and based on MS/MS data, meaning that identifi- 
cation was made on precursor ion mass measure and fragment 
identification. Results of Mascot search showed sequence cover- 
age of 100% for both the HC and LC of the trastuzumab. The 
detailed sequence coverages for both chains are shown in Figure 
2. Some peptides were selected and fragmented several times 
during the analysis leading to the same identification. Similarly, 
missed cleavages could be promptly detected for several peptides 
reinforcing the confidence of the identification by enabling pep- 
tide overlapping. 

Tryptic digestion produces a heterogeneous mixture of pep- 
tides, especially with regard to their number of amino acids. 
Results show the identification during the analysis of small pep- 
tides having 4-5 amino acids such as HT05 or LT09 (Fig. 3) 
as well as a 63 amino acids peptide having a molecular mass of 
6715.26 Da (HT21). During its elution, the ion corresponding 
to HT21 peptide was selected several time for fragmentation and 
resulted in the same identification, which reinforced the confi- 
dence of this important identification. 
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Figure 1. Base Peak Electropherogram corresponding to the analysis by CESI-MS/MS of trastuzumab tryptic digest. Experimental conditions: bare 
fused silica capillary with porous tip, total length 95 cm (30 ixm i.d., 150 |j,m o.d.); Cf voltage +20 kV; BGE 10% acetic acid; sample trastuzumab tryptic 
digest 2.5 mg/mL in BGE (11 nL injected). MS capillary voltage: -1.3 kV, m/z range: 50-3000. 



EVQLVESGGG LVQPGGSLRL SCAASGFNIK DTYIHWVRQA PGKGLEWVAR DIQMTQSPSS LSASVGDRVT ITCRASQDVN TAVAWYQQKP GKAPKLLIYS 



lYPTNGYTRY ADSVKGRFTI SADTSKNTAY LQMNSLRAED TAVYYCSRWG ASFLYSGVPS RFSGSRSGTD FTLTISSLQP EDFATYYCQQ HYTTPPTFGQ 



GDGFYAMDYW GQGTLVTVSS ASTKGPSVFP LAPSSKSTSG GTAALGCLVK GTKVEIKRTV AAPSVFIFPP SDEQLKSGTA SVVCLLNNFY PREAKVQWKV 



DYFPEPVTVS WNSGALTSGV HTFPAVLQSS GLYSLSSVVT VPSSSLGTQT DNALQSGNSQ ESVTEQDSKD STYSLSSTLT LSKADYEKHK VYACEVTHQG 



YICNVNHKPS NTKVDKKVEP KSCDKTHTCP PCPAPELLGG PSVFLFPPKP LSSPVTKSFN RGEC 



KDTLMISRTP EVTCVWDVS HEDPEVKFNW YVDGVEVHNA KTKPREEQYN 



STYRWSVLT VLHQDWLNGK EYKCKVSNKA LPAPIEKTIS KAKGQPREPQ 



VYTLPPSREE MTKNQVSLTC LVKGFYPSDI AVEWESNGQP ENNYIdTPPV 



LDSDGSFFLY SKLTVDKSRW QQGNVFSCSV MHEALHNHYT QKSLSLSPG 



Figure 2. Sequence coverage obtained by CESI-MS/MS for trastuzumab HC (left-hand side) and LC (right-hand side) based on the identified peptides. 
Experimental conditions: see materials and methods. 



It should be noted that such heavy peptides usually can attain detected. Fragmentation of the peptides enables an important 

higher charge state; therefore, the search algorithm must con- part of the amino acid sequence to be retraced reinforcing the 

sider high charge state as well. In the CESI-MS/MS analysis high confidence in peptide identification, 
of the mAb, peptides having charge state up to 6+ could be 
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Position 
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HT01 


EVQLVESGGGLVQPGGSLR 


1-19 


941,501 


LT01 


DIQMTQSPSSLSASVGDR 


1-18 


626,963 


HT02 


LSCAASGFNIKDTYIHWVR 


20-38 


746.703 


LT02 


DIQMTQSPSSLSASVGDRVTITGR 


1-24 


870 42 


HT03 


OAPl^Kl^l FWVAR 

ij(nriJir\lJILCVV Vrtn 


39-50 


656.356 


LT03 


ASQDVNTAVAWYQQKPGK 


25-42 
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HT04 


lYPTMf^YTR 
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51-59 


543265 


LT04 


LLIYSASFLYSGVPSR 


45-61 


77Q 

OOD, lis 


HT05 


1 nujv i\ 


60-65 


341.670 


LT05 


LLIYSASFLYSGVPSRFSGSR 


46-66 


/ OH, / 00 


HT06 


1 nUO V i\U n 


60-67 


448229 


Tnfi 

L 1 UD 


SGTDFTLTISSLQPEDFATYYCQQHYTTPPTFGQGTK 


67-103 


1 r\A7 QflQ 


HT07 


YAn<5\/Kr!RFTI'^AnT<5K' 
1 rtuo V r\unr i lOrtu i or\ 


60-76 


615.976 


LIU/ 


SGTDFTLTISSLQPEDFATYYGQQHYTTPPTFGQGTKVEIK 67-1 07 


1 103,350 


HT08 


YAr)'5\/k'(^RFTI<?AnTQk'MTAVI OMMQI R 


60-87 


785.391 


L 1 Uo 


SGTDFTLTISSLQPEDFATYYGQQHYTTPPTFGQGTKVEIKR 67-1 08 


300, ooy 


HT09 




66-75 


691.802 


LT09 


VEIKR 


104-108 


322 703 


HT10 


RRFTI'^AnT'^KMTAYI OMMCll R 


65-87 


825.411 


LT10 


RTVAAPSVFIFPPSDEQLKSGTASWCLLNNFYPR 


108-142 


971 253 


HT11 


GRFTISADTSKNTAYLQMNSLRAEDTAVYYCSR 


65-98 


948442 


LT11 


TVAAPSVFIFPPSDEQLK 


109-126 


973,513 


HT12 


FTISADTSKNTAYLQMNSLR 


68-87 


754.377 


LT12 


TVAAPSVFIFPPSDEQLKSGTASVVCLLNNFYPR 


109-142 


745,982 


HT13 


NTAYLQMNSLR 


77-87 


655.827 


LT13 


TVAAPSVFIFPPSDEQLKSGTASVVGLLNNFYPREAK 


109-145 


811,817 


HT14 


NTAYLQMNSLRAEDTAVYYCSR 


77-98 


876066 


LT14 


VQWKVDNALQSGNSQESVTEQDSK 


146-169 


893,089 


HT16 


AEDTAVYYCSR 


88-98 


667784 


LT15 


VQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTLSK 146-183 


833,201 


HT16 


WGGDGFYAMDYWGQGTLVTVSSASTK 


99-124 


929089 


LT16 


VDNALOSGNSQESVTEQDSK 


160-169 


712,656 


HT17 


WGGDGFYAMDYWGQGTLVTVSSASTKGPSVFPLAPSSK 99- 1 36 


988975 


LT17 


VDNALQSGNSQESVTEQDSKDSTYSLSSTLTLSK 


150-183 


905,928 


HT18 


GPSVFPLAPSSK 


125-136 


593822 


LT18 


VDNALQSGNSQESVTEQDSKDSTYSLSSTLTLSKADYEK 


150-188 


1057,488 


HT19 


GPSVFPLAPSSKSTSGGTAALGCLVK 


126-150 


830436 


LT19 


DSTYSLSSTLTLSK 


170-183 


751,879 


HT20 


STSGGTAALGCLVK 


137-150 


661.34 


LT20 


ADYEKHKVYACEVTHQGLSSPVTK 


184-207 


916,441 


HT21 


DYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSWWPS 
SSLGTQTYIGNVNHKPSNTK 


151-213 


1343.868 


LT21 
LT22 


HKVYACEVTHQGLSSPVTK 
HKVYACEVTHQGLSSPVTKSFNRGEC 


189-207 
191-214 


714,354 
748,847 


HT22 


DYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSVVTVPS 


151-216 


1177.085 


LT23 


SFNRGEC 


208-214 


435,178 


SSLGTQTYICNVNHKPSNTKVDK 










HT23 


VDKKVEP 


214-221 


471.778 










HT24 


SCDKTHTCPPCPAPELLGGPSVFLFPPKPK 


222-251 


834.404 










HT25 


SGDKTHTGPPCPAPELLGGPSVFLFPPKPKDTLMISR 


222-258 


692.675 










HT26 


TPEVTCVVVDVSHEDPEVK 


259-277 


713676 










HT27 


FNWYVDGVEVHNAK 


278-291 


659932 










HT28 


FNWYVDGVEVHNAKTKPR 


278-295 


540771 










HT29 


TKPREEQYNSTYR 


292-304 


557934 










HT30 


TKPREEQYNSTYRWSVLTVLHQDWLNGKEYK 


292-323 


777198 










HT31 


VVSVLTVLHQDWLNGKEYK 


305-323 


743.4 










HT32 


VVSVLTVLHQDWLNGKEYKCK 


306-325 


83944 










HT33 


VVSVLTVLHQDWLNGKEYKCKVSNK 


305-329 


737139 










HT34 


CKVSNKALPAPIEK 


324-337 


518953 










HT35 


VSNKALPAPIEK 


326-337 


422914 










HT36 


TISKAKGQPREPQVYTLPPSR 


338-358 


589.073 










HT37 


AKGQPREPQVYTLPPSREEMTK 


342-363 


636327 










HT38 


GQPREPQWTLPPSREEMTK 


344-363 


781.722 










HT39 


GQPREPQWTLPPSREEMTKNQVSLTCLVK 


344-373 


872.441 










HT40 


NQVSLTCLVK 


364-373 


581.315 










HT41 


GFYPSDIAVEWESNGQPENNYK 


374-395 


848712 










HT42 


GFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSK 374-412 


1467.351 










HT43 


GFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDK 374-417 


1240.085 










HT44 


TTPPVLDSDGSFFLYSK 


396-412 


626.312 










HT45 


TTPPVLDSDGSFFLYSKL1VDK 


396-419 


81075 










HT46 


LTVDKSR 


413-419 


409.735 










HT47 


LTVDKSRWQQGNVFSCSVMHEALHNHYTQK 


413-442 


901.175 










HT48 


SRWQQGNVFSCSVMHEALHNHYTQK 


418-442 


609.877 










HT49 


GQPREPQVYTLPPSR 


344-358 


1723.910 










HT50 


GQPREPQWTLPPSREEMTK 


344-363 


2342.179 










HT51 


EPQVYTLPPSREEMTK 


348-363 


1903.943 










HT52 


EPQVYTLPPSREEMTKNQVSLTCLVK 


348-373 


3046.555 










HT53 


EEMTKNQVSLTCLVK 


359-373 


1778.900 










HT64 


EEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYK 


359-395 


4304.050 










HT55 


WQQGNVFSCSVMHEALHNHYTQK 


420-442 


934.752 










HT56 


SLSLSPG 


443-449 


660365 











Figure 3. List of trastuzumab digested peptides identified by the CESI-MS/MS analysis. Experimental conditions: see material and methods. 



As seen in Figure 4, the MS/MS spectra quality, has allowed 
the amino acids sequence composing the variable domain of the 
mAb HC for 109 of 120 amino acids to be retraced. Similarly 
98 of the 107 amino acids sequence composing the variable 
domain of the LC could be retraced using the MS /MS data. 



Thus, in addition to allowing the identification of the digested 
peptides of the mAb, the MS/MS spectra enabled characteriza- 
tion, without ambiguity, of most amino acids composing such a 
sensitive part of the mAb, i.e., the variable domain. 
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Figure 4. MS/MS spectra of peptide HT01 and HT02 illustrating the amino acid sequence retracing in the variable domain. Experimental conditions: 
see material and methods. 
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Figure 5. MS/MS spectra of peptide HT04 (lYPTNGYTR) (A) without modification and (B) with a deamidation on Asn^' illustrating the partial modifica- 
tion of the mAb and the characterization of both forms by CESI-MS/MS. Experimental conditions: see Materials and Methods. 



To evaluate the robustness of the method, the CESI-MS/MS 
analysis of the mAb digest was repeated six times consecutively. 
Results of these analysis demonstrated 100% sequence coverage 
for each run and the same detected peptides in each analysis. 

Characterization of amino acids PTMs hot-spots. Mabs are 
heterogeneous by nature because of structural modifications that 
the proteins may undergo during their lifetime. Those modifica- 
tions may be Critical Quality Attributes (CQA) because they can 
modify the conformation of the protein and therefore influence 
the antibody activity. Posttranslational modifications and other 
physico-chemical modifications can be induced by intra or extra 
cellular processes, buffer, purification process, storage or stress 
conditions.^''^^ Among those PTMs, several referred to as "hot 
spots" must be carefully monitored to determine the stability of 
the mAb during pharmaceutical development. Known degrada- 
tion hot spots potentially modified on the HC are cyclization 
of N-terminal glutamic acid (Glu) into pyroglutamic acid and 
deamidation of Asn'' and Asn'**^ but also oxidation of Met''^ 
and Met''^'. The main hot spot on the LC is the deamidation 
Asp'°. For the characterization of trastuzumab PTMs, the same 
CESI-MS/MS data presented in the previous section to obtain 
the mAb peptide mapping was used. In this case again, the 



PTMs were automatically identified by the Mascot search algo- 
rithm. Mass spectra were further manually studied to confirm 
the presence of the modifications. 

Results obtained showed glutamine cyclization of the 
N-terminal extremity of the HC. Also, the same peptide without 
the modification was identified, which suggests a partial modifi- 
cation of the protein. Use of the extracted ion electropherograms 
(XIE) for m/z ratio of 932.500 (2+) and 941.501 (2+) correspond- 
ing to the peptide HTOl with both N-terminal extremity forms 
allowed the determination of a migration time of 34.5 min for 
the intact form of HTOl and 45.6 min for the modified HTOl. 
This charge difference influences the electrophoretic mobility of 
the modified peptide, leading to its separation. 

In the same manner, several deamidation hot spots were also 
identified. Results presented in Figure 5 show the correct identi- 
fication of the deamidation of Asn'' (HT04). In this case again 
the parent peptide could be identified without the modifica- 
tion as well, and the fragmentation allowed confirmation of the 
deamidation of this specific amino acid. Another deamidation 
hot spot present on the HC on Asn^"*^ (HT41) was also charac- 
terized with the method. In this case again, the MS/MS spectra 
obtained for HT41 allowed the identification of the deamidation 
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on this specific amino acid which is crucial because the digested 
peptide is composed of several Asn. Other micro-variants, such 
as methionine (Met) oxidation hot spots, were also identified 
by the algorithm using the CESI-MS/MS data. Results allowed 
the identification of the oxidations of Met^^' (HT25) and Met^^' 
(HT48) as a confirmation of the trend of a partial modification/ 
degradation; both peptides could also be observed without any 
oxidation. The peptide HT25 was detected at a time of 23.8 min 
for the intact form and 24.0 min for the oxidized HT25. A simi- 
lar results could be observed for peptide HT48 as the intact pep- 
tide could be detected at 25.2 min and respectively at a time of 
25.5 min for the oxidized HT25. Concerning the LC, the main 
deamidation hot spot located on Asn^° (LT03) was correctly iden- 
tified by the algorithm. As for the other hot spots, the manual 
confirmation suggested a partial modification/degradation of the 
protein. 

These results unambiguously demonstrate the possibility of 
using CESI-MS to perform simultaneous MS/MS peptide map- 
ping and amino acid PTM characterization. The separation of 
the digested peptide mixture allowed the MS to fragment a large 
number of ions, which enables the detection of PTMs. The mass 
spectra quality obtained using the CESI interface allowed the 
algorithm to successfully identify each studied modification. 

Characterization of major N-glycoforms. For most IgGs, 
glycosylation represents only 2 to 3% of the total mass, but it 
can significantly influence the protein's structure, immunogenic- 
ity and stability. Glycosylation is also a class of mAb PTMs: 
here their study was described separately from amino acid PTMs 
because they are not considered degradation hot spots. To per- 
form an advanced characterization of trastuzumab, an estima- 
tion of the ratio of the main glycoforms, as well as a structural 
determination is necessary. Trastuzumab is N-glycosylated on 
the Asn^"° in the CH2 region. GOF and GIF, the major gly- 
coforms found on human and recombinant IgG produced in 
CHO, NSO and SP2/0 cells, represent more than 80% of the 
glycoforms.'" '^ Considering that Mascot search algorithm is only 
intended to identify the peptide sequences it was necessary to 
manually study the CESI-MS/MS data to identify the glycosyl- 
ations and determine their structures. Interestingly, in this case 
as well, no extra experiments needed to be performed to focus on 
glycopeptides. The same data sets used for peptide mapping and 
hot spots characterization were also used here. 

In conventional protocols, a mAb is commonly treated with 
PNGase F before analysis to release the glycans and allow study of 
them while not attached to the peptide backbone.'^ In this work, 
the mAb was digested by trypsin only, and therefore we expected 
to detect glycans still linked to the corresponding peptide back- 
bone of the HC. The extracted ion electropherograms (XIE) of 
the m/z ratio matching to the peptide TKPREEQYN'™STYR 
bearing respectively the glycosylation GOF, GIF, GOF-GlcNac, 
G2F and GO are shown in Figure 6A-E, respectively. All the 
detected glycopeptides have a charge state of 3. 

The glycopeptides were detected between 26.3 and 28.5 
min. The electrophoretic resolution calculated between the 
peaks corresponding to the glycopeptides bearing GOF and GIF 
gave a value of 0.88, while the resolution between the peaks of 



glycopeptides GIF and G2F gave a value of 1.08. These results 
demonstrate the separation of the glycopeptides GOF, GIF and 
G2F glycan having only one galactose difference in each case. 
The glycopeptide bearing GO could also be observed on its corre- 
sponding XIE; however, this glycopeptide co-migrated with gly- 
copeptides GOF. Moreover, the sensitivity provided by the CESI 
prototype allows detection of these glycopeptides with significant 
intensities and the signal was sufficient to generate good qual- 
ity MS/MS spectra of each glycopeptides. More generally, these 
results allowed us to accurately locate the position of the glycosyl- 
ation on Asn'"" because it is the only potential glycosylation site 
on this peptide. The nature of the glycosylation, however, had to 
be confirmed using MS/MS data. 

MS /MS spectra (Fig. 7) corresponding to the fragmentation 
of the different glycopeptides were subsequently extracted from 
the data. The precursor ion matches the mass of the glycopep- 
tide being glycosylated by GOF with a precision of 8.66 ppm. As 
expected with low energy collision induced dissociation (CID), 
product ions observed in MS/MS mainly correspond to the frag- 
mentation of the glycan moiety and the peptide backbone is usu- 
ally preserved. Study of the fragmentation spectra allowed us to 
deduce the structure of the glycan GOF. In the same manner, 
the MS /MS spectra of the glycopeptides bearing GIF, G2F and 
GO glycans were extracted from the data; in each case the mass 
measure precision on the precursor ion was inferior to 9.69 ppm. 
Similarly to the previous results, every ions corresponding to the 
different fragmentation of the glycan could be observed while the 
peptide backbone remained intact, allowing the structures of the 
studied glycans to be deduced. 

Results presented here demonstrate the possibility of using CE 
to separate and identify glycopeptides obtained from the trastu- 
zumab enzymatic digestion mainly due to the baseline separation 
efficiency of the five major glycopeptides provided by the CE. 
As an additional benefit, the sensitivity provided by the CESI 
interface and the microTOFQ-II MS allowed us to obtain the 
complete fragmentation of the glycan and thereby deduce the 
structure of the five major glycans (GOF, GIF, G2F, GOF-GlcNac 
and GO). The same data was used for MS/MS peptide map- 
ping and hot-spot characterization without requiring additional 
experiments. 

Discussion 

Peptide mapping is essential for the characterization of thera- 
peutic mAbs, which are complex glycoproteins that contain a 
plethora of micro-heterogeneities due to the production process. 
In comparison to PMF, MS/MS peptide identification is not only 
based on digested peptide m/z ratio measure, but also on the 
fragmentation of those peptides, increasing the confidence of the 
identification. Capillary zone electrophoresis (CZE) separation is 
based on the migration of species present in a background elec- 
trolyte (BGE) under an electrical field. Separation is obtained by 
differences in electrophoretic mobility among analytes depend- 
ing on size and charge of the molecule at the BGE pH. 

Peptide mapping results showed the identification of wide 
variety of peptides regarding their chemical composition (Fig. 2). 



www.landesbioscience.com 



mAbs 



7 



1,50 
1,25 
1.00 
0.75 
0.50 
0.25 
0,00 

8,0 

6,0 

4.0 

2.0 

0 
2,5 

2,0 

1,5 

1.0 

0,5 

0,0 

1,25- 

1,00- 

0,75 

0.50 

0.25 

0 
2.5 

2.0 

1.5 

1.0 

0.5 

0 



Intens.xlO'' 

26. 

A 

A 


9 

HT29-J-mC^ 
GOF 

t. .J- .1. 


Intens.xlO^ ^ 
b 


7.5 

HT29-j-»-<^ O 
GIF 

I A" 


. Intens. xio" 
- O 


9 

HT29-J-»-CX^ ■ 

GOF - GlcNac 

^, 


Intens. xio' 
U 

i.JjJll..L.......^J.^Ill>til.luUl,i -I . 1 ,lJlLli.L......iilnt..... 1 .. Al..lllnlHll 


28.0 

HT29-j-^<^^ 
G2F 

li III .ll.. jUliltj 1, J llJulL^L JU..I. J.lL., 1.... 


- Intens. xio' 

26. 

- E 1 

- — 1 — i-f ' 1 — 1 ' ■! ■! ' ; •;' -1 — 1 h-'f'" ( i" -I — 1 ■ 1 ^ 1 *i 4- T 


3 

HT29 ■ ■ QC~^ 
GO 



10 



15 



20 



25 



30 



35 



40 Time [min] 



CO 

o 

CN 

@ 



Figure 6. Extracted Ion Electropherogram of m/z ratios (A) 1039.80 (HT29 + GOF), (B) 1093.78 (HT29 + GIF), (C) 972.09 (HT29 + GO), (D) 114782 (HT29 + 
G2F), (E) 991.10 (HT29 + GO). Experimental conditions: see Material and Methods. 



This result indicates clearly that CE separation driven by peptide 
migration phenomena is able to separate and transfer success- 
fully to the MS every digested peptide regardless of its nature. 
In RP-LC, small peptides (from one to two amino acids^^) and 
more generally hydrophilic peptides, are often not detected 
because they are not retained on the column and elute in the 
void volume." The detection and identification of peptide such 
as LT09 or HT05 demonstrate the ability of CE to conserve and 
detect small peptides. The identification of the 63 amino acids 
peptide HT21 also demonstrates one of the advantages of using 
CE hyphenated to MS for peptide characterization. The presence 
of such large peptides is not surprising as trastuzumab HC does 
not have a tryptic cleavage site from amino acid 151 to 213. The 



elution and detection of such an imposing peptide may be diffi- 
cult in RPLC-MS/MS because it can irreversibly interact with the 
stationary phase. Identification of very large peptides having no 
enzymatic digestion sites or corresponding to peptides or with sev- 
eral missed cleavages is also a very important asset of this method 
because 100% sequence coverage is obtainable in various cases. 
Indeed, some digestion sites in mAbs may not be accessible to 
trypsin due to the quaternary structure of the protein, which may 
cause missed cleavages during the digestion even when guanidine 
is present as a chaotropic reagent. In addition to the complete 
sequence coverage obtained of the HC and the LC, the sensitivity 
provided by the CESI prototype supported by the high resolu- 
tion of the mass spectrometer allowed generation of high quality 
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Figure 7. MS/MS fragmentation spectra of (A) HT29 - GOF (precursor m/z 1039.78; charge state 3+), (B) HT29 - GIF (precursor Ion 1093.80; charge state 
3+), (C) HT29-G2F (precursor Ion 1147.82; charge state 3+), (D) HT29 - GOF-GlcNac (precursor Ion 972.10; 3+), (E) HT29-G0 (precursor Ion 991.09; charge 
state 3+). Experimental conditions described In Materials and Methods section. 
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MS/MS spectra. This result is explained by a remarkable ioniza- 
tion efficiency of the peptides. The low BGE flow rate during 
the separation indeed enhances the ionization process and reduces 
ion suppression.-"'^** The spectral quality is reflected by the pos- 
sibility of retracing the major part of the amino acid sequence in 
the variable domain. As the CESI-MS system is able to transfer 
efficiently an important number of ions, MS/MS spectra were of 
an excellent quality in terms of resolution and intensities (Fig. 3). 
Therefore, a significant number of y/b fragments could be attrib- 
uted, allowing retracing on the variable domain of the HC 109 
of 120 amino acids and 98 of 107 on the LC. Understanding 
the variable domain of the mAb is crucial because it is directly 
involved in the antibody-antigen binding. It is noteworthy that 
such level of characterization could be attained with only one run 
of the CESI-MS /MS system. 

Our results demonstrated the successful characterization 
of every single degradation hot spots on HC and LC and we 
observed in each case the partial modification of the protein 
because the modified and the intact form of the peptide could be 
identified (Fig. 5). In the case of the glutamine cyclization of the 
N-terminal extremity on the HC, the CE separation mechanism 
allowed the separation of the modified and intact forms of the 
peptide based on conformational changes induced by glutamine 
cyclization. Such separation is explained by the fact that even if 
the mass difference between the parent and the modified peptide 
is only 17.02 Da, Glu cyclization involves the loss of the amine 
function on its lateral chain; therefore, there is a difference of 
one charge between glutamic acid and pyroglutamic acid because 
the amine is protonated in 10% acetic acid (pH 2.2). The pos- 
sibility of separate the peptides prior to MS analysis, allows the 
identification of both forms. Indeed as the number of precursor 
ions selected for fragmentation per cycle is fairly limited on the 
microTOF-Q II, and taking into account the complexity of the 
sample, it would have been difficult for the MS to automatically 
select the m/z ratio corresponding to both forms of this peptide 
during their detection window if they had co-migrated. In the 
case of the deamidation hot spots present on Asn'' and Asn'**^ 
on the HC and Asn^" on the LC, the MS was able to fragment 
both the modified and the intact form for each peptide. Unlike 
in the glutamic acid cyclization, the XIE corresponding to those 
peptides (HT04, HT41 and LT03) showed in each case the com- 
plete co-migration between the intact peptide and its deamidated 
homolog. This is explained by the fact that the mass difference 
between asparagine and aspartic acid is only 0.98 Da. Regarding 
deamidation, the amide functional group located on asparagine 
lateral chain, is replaced by an alcohol function. Most impor- 
tantly, there is thus no significant charge difference between the 
deamidated peptide and the intact peptide because those func- 
tions are protonated at BGE pH. In this case of oxidized Met''^ 
(HT25) and Met"'' (HT48), even if the detection times were not 
exactly the same for the intact peptide and its oxidized homolo- 
gous, differences in electrophoretic mobilities could not lead to 
the baseline separation of those peptides. The sensitivity provided 
by the system allowed unambiguous characterization of each 
studied degradation hot spot, in addition to the low represented 
to PTMs and degradations present in small amounts. 



Using the CESI-MS/MS data, the complete structural char- 
acterization of the glycosylation GOF, GIF, G2F, GOF-GlcNac 
and GO could be achieved (Fig. 7). Because trypsin is not able 
to cleave glycosylation from the peptide, the glycan was observed 
on the corresponding digested peptide. The XIE corresponding 
to the glycopeptides bearing different glycans (Fig. 6) demon- 
strated the separation of several glycopeptides in a reduced time. 
This result demonstrated the ability of the CE migration mecha- 
nism to completely separate glycopeptides having a difference of 
only one galactose (e.g., between GOF and GIF). Such levels of 
separation could be explained by the fact that one more galactose 
leads to a difference between the two glycopeptides in term of 
size (+162 Da). This difference affects their relative electropho- 
retic mobilities leading to their separation. The efficiency of CE 
in the glycopeptides separation has allowed the separation to be 
obtained in -27 min. A difference of a fucose, however, did not 
lead to the separation of the glycopeptides GOF and GO, suggest- 
ing that the size of the glycopeptides is not significantly affected 
by the fucosylation at the BGE pH. 

To evaluate the robustness of this method, the analysis was 
consecutively repeated six times using the same mAb digest 
sample. Each analysis resulted in 100% sequence coverage in 
term of MS/MS peptide mapping. Similarly, the degradation 
hot spots identified were the same in each case and the five same 
major N-glycans could also be structurally characterized in the 
six analyses. Intensity ratios between the different glycopeptides 
were calculated for each analysis; the results exhibited a standard 
deviations ranging from 6 to 13%. Based on this results, glyco- 
peptide intensities appear to be related to glycan distribution. It is 
therefore it is possible to consider profiling the mAb glycan using 
this very same CESI-MS/MS method, which would enriching 
the amount of information obtain from only one analysis. 

In conclusion, this work presents the development of a rapid 
and reliable method for performing a multi-level characterization 
of trastuzumab, which was used as reference therapeutic anti- 
body in a single run. The mAb was digested using a conven- 
tional digestion protocol and the digest was analyzed by CESI 
hyphenated to a high resolution tandem MS. The CESI-MS/ 
MS fragmentation obtained in only one acquisition allowed the 
complete amino acid sequence peptide map to be recorded. A 
Mascot search engine was used for peptide identification to pro- 
vide a faster protocol. Also, the characterization of every PTM 
hot spot could be obtained within the same single run. Finally, 
separation of the major glycosylated peptides and their structural 
characterization could be obtained using the same CESI-MS/MS 
analysis data set. Such advanced characterization using only a 
single injection of 100 fmol of digested mAb peptide mixture in a 
45 min analysis demonstrates the potential of CESI-MS as a fast 
and sensitive method in mAb characterization and more gener- 
ally in bottom-up proteomic approach. 

Materials and Methods 

Materials. All chemicals used were of analytical grade or high 
purity grade. Acetic acid was provided by VWR. Bicarbonate 
ammonium, guanidine hydrochloride, iodoacetamide (lAM), 
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dithiothreitol (DTT), trifluoroacetic acid, acetonitrile (ACN) 
and trypsin enzyme were purchased from Sigma-Aldrich. Water 
used to prepare buffers and sample solutions was obtained using 
an ELGA purelab UHQ PS water purification system. Desalting 
steps were performed using Cg spin tips purchased from Protea 
Biosciences. 

Tryptic digestion procedure. Tryptic digests of antibodies 
were prepared using sequence-grade trypsin (1:10 w/w). Before 
digestion, sample was diluted to a concentration of 10 mg/mL. 
One mg of protein was mixed with 155 |JlL of denaturing reagent 
(guanidine HCl 6 M in 50 mM ammonium bicarbonate) and 15 
|xL of DTT 100 mM were added. The mixture was incubated for 
5 min at 95°C, then cooled to ambient temperature. Then, 30 |xL 
of lAM 100 mM was added and the mixture was placed in the 
dark for 20 min. Ten |xL of trypsin was added and the mixture 
was incubated at room temperature for 3 h; additional 10 |xL of 
trypsin were added to the sample for overnight digestion at room 
temperature. Buffer exchange and concentration were performed 
using a Cg spin tips. Four hundred |JlL of mAb tryptic digest in 
BGE could be obtained. A volume of ten |jlL of the obtained 
digest was used to perform all CESI-MS/MS experiments. 

Capillary electrophoresis. The CE experiments were per- 
formed with a PA 800 plus capillary electrophoresis (CE) system 
from Beckman Coulter equipped with a temperature controlled 
autosampler and a power supply able to deliver up to 30 kV. 
Hyphenation was realized using a CESI prototype made available 
by Beckman Coulter. Prototype of bare fused-silica capillaries 
(total length 100 cm; 30 |xm i.d.) with characteristic porous tip 
on its final 3 cm supplied by Beckman Coulter, a second capillary 
(total length 80 cm; 50 |JLm i.d.) filled during experiments with 
BGE allows electric contact. New capillaries were flushed for 10 
min at 75 psi (5.17 bar) with methanol, then 10 min with 0.1 
M sodium hydroxide, followed 10 min with 0.1 M hydrochloric 
acid and water for 20 min also at 75 psi. Finally, the capillary was 
flushed 10 min at 75 psi with BGEn which was acetic acid 10%. 
Hydrodynamic injection (69 mbar for 1 min) corresponding to 
a total volume of 11 nL of sample injected was used. Separations 
were performed using a voltage of +20 kV. 

Mass spectrometry. For antibody characterization, the CESI 
system was coupled to a microTOF-Q II mass spectrometer 
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